A knowledge-based, automated method for phenotyping in the EHR using only clinical pathology reports
نویسندگان
چکیده
The secondary use of electronic health records (EHR) represents unprecedented opportunities for biomedical discovery. Central to this goal is, EHR-phenotyping, also known as cohort identification, which remains a significant challenge. Complex phenotypes often require multivariate and multi-scale analyses, ultimately leading to manually created phenotype definitions. We present Ontology-driven Reports-based Phenotyping from Unique Signatures (ORPheUS), an automated approach to EHR-phenotyping. To do this we identify unique signatures of abnormal clinical pathology reports that correspond to pre-defined medical terms from biomedical ontologies. By using only the clinical pathology, or "lab", reports we are able to mitigate clinical biases enabling researchers to explore other dimensions of the EHR. We used ORPheUS to generate signatures for 858 diseases and validated against reference cohorts for Type 2 Diabetes Mellitus (T2DM) and Atrial Fibrillation (AF). Our results suggest that our approach, using solely clinical pathology reports, is an effective as a primary screening tool for automated clinical phenotyping.
منابع مشابه
Toward high-throughput phenotyping: unbiased automated feature extraction and selection from knowledge sources
OBJECTIVE Analysis of narrative (text) data from electronic health records (EHRs) can improve population-scale phenotyping for clinical and genetic research. Currently, selection of text features for phenotyping algorithms is slow and laborious, requiring extensive and iterative involvement by domain experts. This paper introduces a method to develop phenotyping algorithms in an unbiased manner...
متن کاملAutomated disease cohort selection using word embeddings from Electronic Health Records.
Accurate and robust cohort definition is critical to biomedical discovery using Electronic Health Records (EHR). Similar to prospective study designs, high quality EHR-based research requires rigorous selection criteria to designate case/control status particular to each disease. Electronic phenotyping algorithms, which are manually built and validated per disease, have been successful in filli...
متن کاملIdentifiable Phenotyping using Constrained Non-Negative Matrix Factorization
This work proposes a new algorithm for automated and simultaneous phenotyping of multiple co–occurring medical conditions, also referred as comorbidities, using clinical notes from the electronic health records (EHRs). A basic latent factor estimation technique of non-negative matrix factorization (NMF) is augmented with domain specific constraints to obtain sparse latent factors that are ancho...
متن کاملبررسی استانداردهای ساختار، محتوا و واژهنامه پرونده الکترونیک سلامت در سازمانهای منتخب و ارائه الگوی مناسب برای ایران
Introduction: Electronic health record (EHR) is defined as digitally stored healthcare information about an individual's life time with the purpose of supporting continuity of care, education, and research. Major issue that needs to be addressed in order to accomplish with sharing and exchange is the development and use of content and structure standards in the EHR. Based on, this investigation...
متن کاملAberration of Erythrocyte Sedimentation Rate by Zinc Nanoparticles
Dear Editor, erythrocyte sedimentation rate (ESR) is a useful basic clinical pathology laboratory investigation. It can be helpful in diagnosis and follow-up of several diseases. At present, a new automated method with proven reliability is available for ESR test (1). Here, the authors report on observation on a laboratory experiment to test the effect of zinc nanoparticles on ESR results. The...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 2015 شماره
صفحات -
تاریخ انتشار 2015